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Through the classical umbral calculus, we provide a unifying syntax for single and multivariate k- 
statistics, polykays and multivariate polykays. From a combinatorial point of view, we revisit the 
theory as exposed by Stuart and Ord, taking into account the Doubilet approach to symmetric 
functions. Moreover, by using exponential polynomials rather than set partitions, we provide a 
new formula for fc-statistics that results in a very fast algorithm to generate such estimators. 

Keywords: cumulants; fc-statistics; polykays; symmetric polynomials; umbral calculus 

1. Introduction 

The theory of fc-statistics has a long history. Here, we outline the literature, beginning 
with Fisher (1929), who rediscovered the half-invariants theory of Thiclc (1897). He 
introduced fc-statistics (single and multivariate) as new symmetric functions of a random 
sample, aiming to estimate cumulants without using moment estimators. Dressel (1940) 
developed a theory of more general functions, later resumed by Tukey (1950), who named 
them polykays. Both Tukcy (1956) and Wishart (1952) developed methods to express 
polykays in terms of Fisher's fc-statistics. These methods are straightforward enough, but 
their execution leads to intricate computations and some cumbersome expressions, except 
in very simple cases. Later, many authors tried to simplify the matter. Kaplan (1952) 
resorted to tensor notation in order to simplify multivariate fc-statistics. Good (1975) gave 
an interpretation of cumulants as coefficients of the Fourier transform of the randomly 
ordered sample and used this formula in order to obtain expressions for single fc-statistics 
(1977). The whole subject was later described in great detail by Stuart and Ord (1987). 
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In the 1980's, tensor notation was employed by Speed (1983, 1986a, 1986b, 1986c), 
Speed and Silcock (1988a, 1988b) and extended to polykays and single fc-statistics. This 
extension reveals the coefficients defining polykays to be values of the Mobius function 
over the lattice of set partitions. As a consequence, Speed used the set theoretic approach 
to symmetric functions introduced by Doubilet (1972). In the same period, McCullagh 
(1984, 1987) simplified the tensor notation of Kaplan by introducing the notion of gen- 
eralized cumulants. Symbolic operators for expectation and the derivation of unbiased 
estimates for multiple sums were introduced by Andrews (2001), Andrews and Stafford 
(1998, 2000). Algorithms to compute fc-statistics and their generalizations were derived 
from such techniques. 

In 1994, Rota and Taylor brought new life to the celebrated umbral calculus, with the 
intention of restoring (within a modern formal setting) the computational power origi- 
nally dealt with in the writings of Blissard, Cayley and Sylvester. Their basic device was 
the representation of a unital sequence of numbers by a symbol, called the umbra, via an 
operator resembling the expectation operator of random variables. This shifts the atten- 
tion to the moment generating function rather than the probability generating one, with 
the latter usually linking the umbral calculus of Roman and Rota (1978) to probability 
theory (for a survey, see Di Bucchianico (1997)). The umbral calculus of Rota and Tay- 
lor, known as the classical umbral calculus, has been developed by Di Nardo and Scnato 
(2001, 2006a), paying particular attention to relations with probability theory. It has also 
been successfully applied in wavelet theory (Saliani and Senato (2006) and Shen (1999)). 

In this paper, we show how the classical umbral calculus provides a unifying framework 
for fc-statistics and their generalizations. Most of the results can be found in some form 
in the literature; nevertheless, we feel that the umbral approach unifies, generalizes and 
simplifies their presentation. In order to demonstrate the power of the umbral method- 
ologies, we conclude the paper by presenting an algorithm that generates fc-statistics in 
a very short computational time compared with existing procedures. 

Section 2 is aimed at readers unaware of the classical umbral calculus. Here, we re- 
call basic definitions and terminology. In Section 3, we consider symmetric polynomials 
in umbral frameworks and introduce some combinatorial tools. If the indetcrminatcs of 
symmetric polynomials are replaced by uncorrelatcd umbrae, surprisingly compact ex- 
pressions for both symmetric polynomials and relations between bases can be achieved. 
From a statistical point of view, this means obtaining elementary formulae connect- 
ing symmetric polynomials in the data points. Thus, the umbral approach recovers the 
methods exposed by Stuart and Ord, whose main limitation is the complexity of both 
the expressions and the procedures. In Section 4, the theory of fc-statistics and polykays 
is completely rewritten, stressing the power of the umbral methods by means of some ex- 
amples. In Section 5, we introduce the notion of umbra indexed by a multiset, in order to 
get umbral expressions for multivariate fc-statistics and multivariate polykays. Through 
this device, we represent multivariate moments and multivariate cumulants by umbrae. 
Moreover, we state identities involving multivariate moments and multivariate cumulants 
by simply characterizing a suitable multiset indexing. In the last section, we give a very 
fast algorithm to compute fc-statistics, originating from a new umbral expression for such 
estimators. Comparisons of computational times achieved by the Andrews and Stafford 
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algorithm, that of MATHSTATICA (Rose and Smith (2002)) and the proposed umbral 
algorithm end this paper. 

2. Background to classical umbral calculus 

This section reviews notation and terminology useful when handling umbrae. More details 
and technicalities can be found in Di Nardo and Senato (2001, 2006a). 
Formally, umbral calculus is a syntax consisting of the following data: 

(i) a set A = {a,/?, . . .}, called the alphabet, whose elements are named umbrae; 

(ii) a commutative integral domain R whose quotient field is of characteristic zero; 

(iii) a linear functional E, called the evaluation, defined on the polynomial ring R[A] 
and taking values in R such that 

• E[a l (3 3 ■ ■ -7 fe ] = Ela^Elfli] ■ ■ ■ E[y k ] for any set of distinct umbrae in A and for 
i,j,...,k non-negative integers {uncorrelation property); 

(iv) an element e € A, called an augmentation, such that -E[£ n ] = for every n > 1; 

(v) an element u 6 A, called a unity umbra, such that E[u n ] = 1 for every n > 1. 

An umbral polynomial is a polynomial p G The support of p is the set of all umbrae 

occurring in p. If p and q are two umbral polynomials, then 

(i) p and q are uncorrelated if and only if their supports are disjoint; 

(ii) p and q are umbrally equivalent iff E[p] = E[q] (in symbols, p ~ q). 

The moments of an umbra a are the elements a n £ R such that _E[a n ] = a n for n > and 
we say that the umbra a represents the sequence of moments 1, a±, ■ ■ ■ ■ 

It is possible that two distinct umbrae represent the same sequence of moments. In 
such a case, they are called similar umbrae. More formally, two umbrae a and 7 are said 
to be similar when 

E[a n ] = E[y n ] for every n > (in symbols, a = 7). 

Furthermore, given a sequence 1, a\, a%, . . . in R, there are infinitely many distinct, and 
thus similar, umbrae representing the sequence. 

The factorial moments of an umbra a are the elements a(„) £ R corresponding to 
umbral polynomials (a) n = a(a — 1) • • • (a — n + 1), n > 1, via the evaluation E, that is, 
E[(a) n ] =a (n) . 

Example 2.1 (Singleton umbra). The singleton umbra \ is the umbra whose mo- 
ments are all zero, except the first E[\] = 1. Its factorial moments arc X( n ) = ( — l)"~ 1 (n — 
1)!. 

Example 2.2 (Bell umbra). The Bell umbra (3 is the umbra whose factorial moments 
are all equal to 1, that is, E[{j3) n ] = 1 for every n > 1. Its moments are the Bell numbers. 
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Thanks to the notion of similar umbrae, it is possible to extend the alphabet A with 
the so-called auxiliary umbrae obtained via operations among similar umbrae. This leads 
to the construction of a saturated umbral calculus in which auxiliary umbrae are treated 
as elements of the alphabet (Rota and Taylor (1994)). Auxiliary umbrae come into play 
when dealing with products of moments. Note that aiOj ^ E[a % a?\, with a,; = -E[a z ] 
and a.,- = E[a J ], but aiaj = E[a l a' J ], with a = a' and a' uncorrelated with a. As a 
consequence, the umbra a + a' represents the sequence 



We may then denote the umbra similar to a + a' by the auxiliary umbra 2. a. In a 
saturated umbral calculus, the umbra 2. a is considered as an element of the alphabet A. 

In the following, we focus attention on auxiliary umbrae which will subsequently have 
a special role. Let {ai, c*2, ■ ■ ■ , a n } be a set of n uncorrelated umbrae similar to an umbra 
a. The symbol n.a denotes an auxiliary umbra similar to the sum a\ + ai + • • • + a n . 
The symbol a'"denotes an auxiliary umbra similar to the product a\ai • ■ ■ a n . Properties 
of these auxiliary umbrae are extensively described in Di Nardo and Senato (2001) and 
will be recalled whenever necessary. 

Remark 2.1. If n ^ m, then n.a is uncorrelated with m.a and a' n is uncorrelated with 
a' m . If p and q are correlated umbral polynomials, then n.p ^p\ + ■ • ■ + p n is correlated 
with n.q ~ q% H — • + q n , and pi is correlated with qi, but uncorrelated with qj if i ^ j. 

If the umbra a represents the sequence 1, 0,1,0,2, ... , then E[(a' n ) k ] = a% for non- 
negative integers k and n. 

In Di Nardo and Senato (2006a), the moments of n.a have been expressed in terms of 
moments of a, by means of the Bell exponential polynomials. Here, we adopt a different 
point of view referring to the notion of integer partitions, obviously connected to the Bell 
exponential polynomials through well-known relations. 

Recall that a partition of an integer i is a sequence A = (Ax, A2, . . . , At), where Aj are 
weakly decreasing integers and Ylj—i A? = *• The integers Aj are named parts of A. The 
length of A is the number of its parts and will be indicated by v\. A different notation is 
A = ( V 1 , 2 r2 ,...), where rj is the number of parts of A equal to j and r% + r% + • • • = v\ . 
We use the classical notation A h i to denote that A is a partition of i. Through the 
multinomial expansion theorem, powers of n.a can be seen to be umbrally equivalent to 
the umbral polynomials 





(2.1) 



where the sum is over all partitions of the integer i, (n) 



when v\ > n, 



dx 



1 



and Q A = (a, 1 )-'' 1 (a J 2 2 )- r2 ---, 



(2.2) 



ri!r 2 !--- (l!) ri (2!)^ ••• 
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with {ji} distinct integers chosen in {1,2, ...,n}= [n]. In particular, when evaluating 
the powers of n.a in (2.1), we have 

S[(n.a) J ]=^(n)„ A d A a A , (2.3) 

Ahi 

where a\ = a^a^ 2 

A feature of classical umbral calculus is the construction of new auxiliary umbrae by 
symbolic substitution. For example, if we replace the integer n in n.a with an umbra 7, 
then, from (2.1), the new auxiliary umbra 7.0; has powers 

( 7 .a) i ~^(7)^d A a A . (2.4) 

Ahi 

Equivalence (2.4) has been formally proven by using the notion of the generating function 
of an umbra; for further details, see Di Nardo and Senato (2001). 

In the dot product 7. a, replacing the umbra 7 with the umbra 7./?, we obtain the 
composition umbra of a and 7, that is, "f.(3.a. Its powers are 

{j.p.aY-Y.l^dxax. (2.5) 

Ahi 

The compositional inverse of an umbra a is the umbra a < ~ 1> such that 

a<- 1> ./3.a = a./3.a < - 1> = x . 

The compositional inverse of an umbra was introduced in Di Nardo and Senato (2001) 
in order to invert exponential power series. It has also been used to give a simple proof 
of the Lagrange inversion formula in umbral terms. Moreover, the compositional inverse 
of an umbra provides a link between the Bell and singleton umbra. For the purpose of 
this paper, we consider the compositional inverse of the unity umbra. 

In the following examples, powers of fundamental auxiliary umbrae are given via (2.4). 
Properties of such umbrae are described in Di Nardo and Senato (2001, 2006a). 

Example 2.3 (a-partition umbra). The umbra j3.a. with (3 the Bell umbra, is called 
the a-partition umbra. By virtue of (2.4), its powers are 

{f3.ay ~Y,dxa x (2.6) 

Ahi 

because the factorial moments of the Bell umbra are all equal to 1 (see Example 2.2). In 
particular, we have 

p.u<- l> = X , P.u = 0, P-X = u, (2.7) 
where u < ~ 1> denotes the compositional inverse of u. 
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Example 2-4 (a-cumulant umbra). The umbra %.a, with x the singleton umbra, is 
called the a-cumulant umbra. By virtue of (2.4), its powers are 

(X.af ~^x {vx) d x a x , (2.8) 

Ahi 

where x^ x ^ are the factorial moments of the umbra x ( see Example 2.1). In particular, 
it is possible to prove that 

X-(3 = u, x-X = m <_1> . 

Example 2.5 (a-factorial umbra). The umbra ot-x is called the a-factorial umbra 
and its moments are the factorial moments of a, that is, {a-xY — {ot)i- 

The disjoint sum of a and 7 is the umbra whose moments are the sum of nth moments 
of a and 7 respectively (Di Nardo and Senato (2006a)) — in symbols, 

(a + 7)™ ~ a" + 7™ for every n > 0. 

For instance, it is possible to prove 

X.a + x-7 = X-(" + 7), (2-9) 

. n 

the well-known additive property of cumulants. In the following, we denote by + i=1 ct% 
the disjoint sum of n umbrae and by + n a the disjoint sum of n times the umbra a. 



3. Symmetric polynomial umbrae 

We begin by recalling the definitions of the four classical bases of the algebra of symmetric 
polynomials in the variables Xi,X2, • • • ,x n . They are: 
elementary symmetric polynomials 

1<J1<J2<— <jfc<n 

power sum symmetric polynomials 

n 

s r ^ ' x^ , r 1,2,...; 

i=l 

monomial symmetric polynomials 

where the sum is over all distinct monomials having exponents Ai, . . . , A$; 
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complete homogeneous symmetric polynomials 

hi = y ^ mx = y ^ Xj 1 Xj 2 ■ ■ ■ Xj k . 

Xhi l<jl<j2<---<jk<n 

In the following, we replace the commutative integral domain R by K[x\, X2, ■ ■ ■ ,x n ], 
where K is a field of characteristic zero and x\,X2, ■ ■ ■ ,x n are variables. Therefore, the 
uncorrelation property (iii) of Section 2 must be rewritten as 

E[l] = 1; E[xiXj ■ ■ ■ a k (3 l ■■■]= x t x 3 ■ ■ ■ E[a k ]E[/3 l ] ■ ■ ■ 

for any set of distinct umbrae in A, for [n] and for non-negative integers k,l,.... 

In K[xi,X2, ■ ■ ■ ,x n ][A], an umbra is said to be a scalar umbra when its moments are 
elements of K, while it is said to be a polynomial umbra if its moments are polynomials 
of K[xi, X2, ■ ■ ■ , x n ]. A polynomial umbra is said to be symmetric when its moments are 
symmetric polynomials in K[x\ , X2, ■ ■ ■ , x n ] . 

A sequence of polynomials po,Pi, ■ ■ ■ S K[x±, X2, ■ ■ ■ , x n ] is umbrally represented by a 
polynomial umbra if po = 1 and p n is of degree n for every nonnegative integer n. The four 
classical bases of the algebra of symmetric polynomials are all represented by symmetric 
polynomial umbrae. In particular, we call the polynomial umbra e such that 



E\e k ] 



e k , fc = l,2,...,n, 
0, fc = ra+l,ra + 2,.. 



elementary polynomial umbra. Wc call the polynomial umbra a such that ^[c 7 ] = s r for 
every nonnegative integer r, power sum polynomial umbra. 

Proposition 3.1 (Elementary polynomial umbra). If \i, ■ ■ - ,Xn are n uncorrelated 
umbrae similar to the singleton umbra, then 

e^ (Xl * 1 + - fc +*"*" )fc , fc = l,2 (3.1) 

where e is the elementary polynomial umbra. 



Proof. For k = 1, . . . ,n, the result follows by applying the evaluation E to the multi- 
nomial expansion of (xi x i + ' ' ' + Xn x n) k ■ There are vanishing terms corresponding to 
the powers of x greater than 1. Only kl monomials of the form Xji x hXh x h "'Xjk x jk 
have a non-zero evaluation. Instead, for k = n + 1, n + 2, . . . , the evaluation E gives zero 
since at least one power of x greater than 1 occurs in each monomial of the multinomial 
expansion. □ 

Proposition 3.2 (Power sum polynomial umbra). If u is the unity umbra and a is 
the power sum polynomial umbra, then 



k-statistics, polykays and their multivariate generalizations 



447 



The following theorem gives an umbral relation between the elementary polynomial 
umbra and the power sum polynomial umbra. 

Theorem 3.1. If Xli ■ ■ ■ >X« are n uncorrelated umbrae similar to the singleton umbra 
X and a is the power sum polynomial umbra, then 

X-(XlXl-\ + XnXn) = (X-X)<7- (3-2) 

Proof. Equivalence (3.2) follows from (2.9), observing that 

. n . n 

X-(xiXi H 1" XnX n ) = + i= i X\Xi%i) =+i=i (x-X)Xi = (x-x)o-- (3.3) 

□ 

The next corollary points out a deeper meaning of Theorem 3.1, that is, by evaluat- 
ing the moments of the umbrae in (3.2), the well-known relations between power sum 
symmetric polynomials {s r } and elementary symmetric polynomials {e^} are recovered. 

Corollary 3.1. If {s r } are the power symmetric polynomials and {e^} are the elemen- 
tary symmetric polynomials, then 

(— lV _1 ^-^ e ri e 1 " 2 1 v-^ 

• S « = Z^ X (^)^!^T e l = -^Jd A (x (1)Sl ) ri (x (2) s 2 ) r2 ---. (3.4) 

AH AH 

Proof. From (3.1), we have 

E[(XlXl H 1" XnX n )\] = E[x\Xi H h XnXnY 1 E[(xiX\ H h XnX n ) 2 ] T2 ' " ' 

= (V.) ri e r 1 1 (2\) r2 e r 2 2 ••• 

and so 

e Tl e T2 

E[(x-{xixi + ■■■+ XnXn)Y] = i\ X M J~\ r\ " ' ' 

X\-i 

By Theorem 3.1, we have 

E[(X-(XlXl + ■■■ + XnXn)) 1 ] = £[(X-X) V] 

and the former identity in (3.4) follows, observing that 

E[(x.x) i o- i } = (-l) i - 1 (i-iy.s i . 
Taking the right dot product with the Bell umbra j3 in (3.2), we have 

P'[(X-X)°] = P-X-(XlXl + h XnXn) 
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and, by virtue of the third equivalence in (2.7), we have 

0-[(x-X)°] =XiXi H \-XnXn 

Equivalence (2.6) gives 



ACTA 



A hi 



The latter identity in (3.4) follows by observing that E[a\] = (si) ri (s2) T2 ■ ■ ■ and 

The umbral expression of to a requires the introduction of augmented monomial sym- 
metric polynomials rh\. Let A = (l ri ,2 r2 , . . .) be a partition of the integer i < n. Aug- 
mented monomial symmetric polynomials are defined as 



2 2 

jl#---#ir 1 #>i + 17 t ---#> 1 + r 2 ^--- 



The next proposition can be proven using the same approach as used in the proof of 
Proposition 3.1. 

Proposition 3.3. If xi, ■ ■ ■ ,Xn are n uncorrelated umbrae similar to the singleton um- 
bra, then 

™>\ - (Xi%i H 1" XnX n ) ri {xi%l H h XnX 2 n Y 2 ■■■■ (3.5) 

The next corollary follows by recalling that to a = rh\/ '(ri!^! • ■ ■)■ 
Corollary 3.2. 

(XiXi-\ ^XnX n ) ri (xia;?H ^XnX 2 n Y 2 

TO A ~ j j . 

ri! r 2 ! 

In order to characterize the symmetric polynomial umbra representing the complete 
homogeneous symmetric polynomials, we need to recall the notion of the inverse of an 
umbra. Two umbrae a and 7 are said to be inverse to each other when a + 7 = e. The 
inverse of the umbra a is denoted by — l.a. Note that, in dealing with a saturated umbral 
calculus, the inverse of an umbra is not unique, but any two inverse umbrae of the same 
umbra are similar. 

Proposition 3.4 (Complete homogeneous polynomial umbra). If xii ■ ■ ■ iXn are 

n uncorrelated umbrae similar to the singleton umbra, then 

, „ {-UXl(-Xl) + ■ ■ ■ + Xn(-X n )]Y 

hi~ , 1 = 1,2,.... (3.6) 
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Proof. From the multinomial expansion theorem, we have 
[-l.(-Xia;i)+--- + -l.(-X„a;„)] < 

-* ! E [ _1 -( _ Xji)]' ri ([-l-(-Xj 2 )] 2 )" r2 ' • • (i!)ri ri !(2!)?-2 r2 ! . . . ' 

where ji,j2, ■ ■ ■ are distinct integers chosen in [n]. As ([— 1.(— x)Y)' ri — (*Q r S the result 
follows from Corollary 3.2 and Proposition 3.3. □ 

Note that equivalences (3.1) and (3.6) are umbral versions of the well-known identities 

n 1 

E e ^ k = + E h >>* = fr (i- x -tv 

k i=l k ll*=lV 1 > 

Proposition 3.5. Ifxit ■ ■ -iXn o,re n uncorrelated umbrae similar to the singleton umbra 
X and a is the power sum polynomial umbra, then 

- X-[Xi(-Xi) + • ■ ■ + X»(^n)] = [-X-(-x)]c7- (3-7) 

Proof. Equivalence (3.7) follows by replacing the umbra x with ~x and the umbra 
u<- 1> ee x-X with (-w) < - 1> = (-x)-(-x) in (3.3). □ 

Equivalence (3.7) is an umbral version of the well-known relations between power sum 
symmetric polynomials {,s r } and complete homogeneous symmetric polynomials {hk}- 

3.1. Umbral symmetric polynomials 

Assume that we replace the variables xi,X2, ■ ■ ■ ,x n in the umbral polynomials Xi x i + 
• • • + Xn x n with n uncorrelated umbrae a\, a%, . . . , a n similar to an umbra a. Since 

Xi«i H hx„a„ = n.(xa), 

from Proposition 3.1, we have 



[n.{x<*)] k 



■ e k (ai,. . . ,a n ), 



where efc(ai, . . . ,a n ) are umbral elementary symmetric polynomials in if [A]. The same 
substitution in ux\ + ■ • ■ + ux n gives 

• n • f ' \k k k k 

+ i=1 uai =+ n a =>■ (+„ at) ~ a 1 H ha„=n.a 

so that n.a k = Sk(ati, . . . , a n ), where Sk(cti, . . . , a n ) are umbral power sum symmetric 
polynomials in if [A]. By using the latter identity in (3.4), we prove the following propo- 
sition. 
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Proposition 3.6. If x is the singleton umbra and a € A, then 

[n.( X a)] k ^dx(x-x)x(n.a) ri (™-« 2 P • ' ' • ( 3 - 8 ) 

Ahfc 

Due to (3.5), umbral augmented symmetric polynomials rh\ take the form 

rh x ~ [n.(xa)] Tl [n.(xa 2 )] T2 
Theorem 3.2. If X hi, then 

{n.(xa)P{n.(xa 2 )r--^(n)„ x a x . (3.9) 
(For the proof, see Di Nardo and Senato (2006b).) 

Statistically speaking, the last theorem states how to estimate products of moments 
by using only n sampled items. Moreover, by using Theorem 3.2 and (2.1), we are able 
to give a sort of inversion formula of equivalence (3.8), that is, 

(n.a) k ~ d x [n.(x<*)] ri Mxa 2 )P ■ • ■ • (3-10) 

Ahfc 

Equivalences (3.8) and (3.10) can be rewritten by using set partitions instead of integer 
partitions. Afterward, the use of set partitions allows a natural generalization of these 
equivalences. 

Let C be a subset of K\A] such that C = n. Recall that a partition it of C is a 
collection ir = {£>i, £?2, . . . ,Bk} with k < n disjoint and non-empty subsets of C whose 
union is C. We denote by IT n the set of all partitions of C. 

Let {a±, 0J2, • • • j ot n } be a set of n uncorrelated umbrae similar to an umbra a. We will 
denote by a ,7r the umbra 

where n = {B\, B2, ■ ■ ■ , Bk} is a partition of {ai, 012, ■ • • , ct n } and i±, 12, ■ ■ ■ , ik are distinct 
integers chosen in [n]. In particular, a ,7r = a\, where A is the partition of the integer n 
determined by tt. Indeed, a set partition is said to be of type A = {V 1 , 2 T2 , . . .) if there 
are r\ blocks of cardinality 1, T2 blocks of cardinality 2 and so on. The number of set 
partitions of type A is d\, as given in (2.2). 

Proposition 3.7. IfHk is the set of all partitions of [k] and a 6 A, then 

(n.a) k ~ Yl MX«)P MX« 2 )P • ■ •, (3-12) 

[n.( X a)] k * (x-x)'>-aP ("-« 2 ) r2 • • • • (3.13) 
7ren fc 
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Proof. Equivalence (3.12) follows directly from (3.10). From (3.11) we have (x-x)x = 
(x-x) ,7r - Hence equivalence (3.8) imples (3.13). □ 

In order to write products like (n.a) Tl (n.a 2 ) T2 ■ ■ ■ using a single symbol such as ct\ or 
a -7r , we need the notion of the multiset. This will be introduced in the next section. 

3.2. Some necessary combinatorics 

A multiset M is a pair (M, /), where M is a set, called the support of the multiset, and 
/ is a function from M to the non-negative integers. For each /x g M, f(/i) is called the 
multiplicity of /x. The length of the multiset (M, /), usually denoted by \M\, is the sum 
of multiplicities of all elements of M, that is, 

|M| =£_/(**)■ 

From now on, we denote a multiset (M,f) simply by M. A multiset Mi = (Mi,fi) is 
called a submultiset of M = (M, /) if M, C M and fi(fi) < /(/i) for every /i G M,. 

Let M be a multiset of umbral monomials. When the support of M is a finite set, say 
M = {/xi,/X2, . . . , /Ufc}, we will write 

]\ T — /..(/(Ml)) ..(/(Ma)) M — S it it a a \ 

M — {fa ,/x 2 , . . . ,/i fc j or M — | Ati i --;Mi i ■ ■ ■ , Vk,- - -,fJik t- 

f(t*i) f(p-k) 

Set 

When /(/x) = 1 for every £ M, the multiset M is simply a set -B of umbral monomials 
in K[A] and hence (3.14) becomes 

Ms = 11 M- 

If S = {ai, . . . , oti}, with uncorrelated umbrae similar to an umbra a, then olb = a'' B ' . 

The notation (3.14) can be easily extended to umbral polynomials and dot products 
as follows: 

p M = n p m > ( a -p)™ = n («-p) /(p) > mxp)u = n [ n -^p)\ Kp \ 

where p are umbral polynomials in K[A] and a <E A. For example, using this notation, 
equivalence (3.9) can be rewritten as 



(3.15) 



452 



E. Di Nardo, G. Guarino and D. Senate 



where P\ = {o^ ri \ a 2<J ~ 2 \ . . .}. To avoid misunderstandings, we will specify the multiset 
M where necessary, in order to know which umbrae occur in M . 

The notion we are going to introduce is quite natural and allows us to compress and 
simplify notation. 

Definition 3.1. A subdivision of a multiset M is a multiset S of k < \M\ non-empty 
submultisets Mi = (Mi, ft) of M such that 

(i) \jtiMi = M; 

( u ) Ei=i MaO = f(p) f° r ever y /" e M - 

We note that the notion of subdivision is different from that of multiset partition. 

Example 3.1. Let M = {/ij 2 ', fJ^\ 1^^}^ hence \M\ — 5. Subdivisions of M are 
{{^\^},{^},{^}} and {{^U^U^}}. 

Let S = (S,g) be a subdivision of the multiset M. Extending the notation (3.14), we 
set 



Mies 

and so 



Ms=I1^m Mi) (3-16) 



(n.n) 8 = II (n-^u) 9(M '\ [n.(xi*)]s= U Mx^)} 9 ^ ■ (3.17) 
M.es Aues 

Remark 3.1. A special case of (3.16) is the partition of a set. The notation (3.16) 
becomes 

Beir 

We may construct a subdivision of the multiset M in the following steps: assume that 
the elements of M are all distinct, build a set partition and then replace each element in 
any block with the original one. Thus, any set partition gives rise to a subdivision. More 
formally, consider a set [k] of k umbral polynomials. Define the function 

s:[k]^M (3.18) 

such that f(fJ-i) elements of [k] go in fii, /(/X2) elements of [k] go in fi2 and so on. Now 
consider a partition tt = {Bi, _B 2 , • ■ ■ , B m } of [k] into m blocks. Set 

(i) Mi = s(Bi)C_M; 

(ii) for any /x € Mi, /i(/x) = the number of p € Bi such that s(p) = fx. 
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The multiset SV, built with Mj = (Mj, /j), i = l,2,...,m, is the subdivision of M cor- 
responding to the partition 7r. Note that |7r| = |SV|- Moreover, it could be S Vl = S 1T2 for 

Remark 3.2. If M is a set, it is natural to define s as the identity function so that 
Sir = ft- If M is a subdivision, then s(p) is a multiset. 

We simplify equivalences (3.12) and (3.13) by means of the notion of subdivision. 

Proposition 3.8. If is the subdivision of the multiset M = {a^} corresponding to 
a partition -k £ n& , then 

[n.( X a)] k ~ J2 (x-xY*{n.a) s ^ (n.a) k ~ £ [n.( X a)] Sl . (3.19) 

nen k 7ren fc 

Proof. Subdivisions of M = {a^} are of the type 

S={{a},...,{a},{a^},...,{a^},...}, (3-20) 

with n + r 2 + • • • < k and rx + 2r 2 + ••• = &. Via the function s in (3.18), the multiset S 
corresponds to a partition n of [fc] with rx blocks of cardinality 1, r 2 blocks of cardinality 
2 and so on, so that 

(n.«K= [] («.a A / 1 ) 9(A/ ' ) = (n.a) ri (^-a 2 r 2 ---, 

by which the former equivalence in (3.19) is the result of (3.13). The latter equivalence 
follows from (3.12) by similar arguments. □ 

A natural extension of the notation to umbral polynomials leads to the following 
corollary. 

Corollary 3.3. If M is a multiset of umbral polynomials, then 

Mxp)]m - ^2 (x-x)' n (n-P)s„, (n.p) M - ^2 [n-(XP)]s„, (3.21) 
wen,, 7ren fc 

where k is the length of M and is the subdivision corresponding to the partition tt £ 11/.. 

4. fc-statistics and polykays 

If ai, a-i, . . . are moments of a random variable and K\, K2, ■ ■ ■ are its cumulants, then 

«i=X)(- i r" 1 (^-i)!<w (4- 1 ) 

AH 
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where A = (V 1 , T" 2 ,...), a\ = a^a^ 2 ■ ■ ■ and d\ is given in (2.2). If the umbra a represents 
the sequence 01, ■ ■ ■ , then the sequence of its cumulants k\, K2, ■ ■ ■ is represented by 
the a-cumulant umbra x- a i as follows by comparing (2.8) and (4.1) (for more details, see 
Di Nardo and Senato (2006a)). The ith fc-statistic fc, is the unique symmetric unbiased 
estimator of the cumulant Ki of a given statistical distribution, that is, 

E[ki] = Ki 

(see Stuart and Ord (1987)). fc-statistics are usually expressed in terms of sums of the 
rth powers of the data points 



In the following, we will give an umbral expression of fc-statistics by using umbral power 
sum symmetric polynomials in n uncorrelatcd and similar umbrae, that is, n.a r . 

Theorem 4.1 (fc-statistics). For i < n, we have 

{x . a y^ { -TY^d x {x-xY*{n.a) s „, (4.2) 
ah [n>u >- neu„ x 

where A = (l ri , 2 r2 , . . .) runs over all partitions of the integer i and is the subdivision 
of the multiset P\ = {a^ ri \ a 2 ^ T2 \ . . .} corresponding to the partition tt G II„ a . 

Proof. By replacing equivalence (3.15) in (2.8), we have 

AH {U) ^ 

where Pa = {cS Tl \ a 2 ',...}. Equivalence (4.2) is the result of the former in (3.21), 
where p has been replaced by a and the multiset M by P\ . □ 

Since E[(x-ct) 1 ] = Ki, equivalence (4.2) gives the umbral expression of the ith cumulant 
in terms of umbral power sum symmetric polynomials, that is, the zth fc-statistic. 

Example 4-1- The partitions of the integer 3 are {(l 3 ), (l 1 , 2 1 ), (3 1 )}, of length 3,2,1, 
respectively. Hence, 

for A= (l 3 ), ( 1, for A= (l 3 ), 

!(„„) = ^-1, for A=(l 1 ,2 1 ), d A = <3, for A = (l 1 , 2 1 ), 
for A= (3 1 ), [l, for A = (3 1 ). 
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From (4.2) and Table 1, we have 

2 3 1 

h = T~r( 2s 3 ~ 3s l s 2 + si) - ^-r- (-S 3 + SiS 2 ) + -«3 

(n) 3 (n) 2 n 

n 2 S3 — 3ns is 2 + 2sf 

= ' 

which is the well-known fe-statistic of order 3. 

Products of fc-statistics are known as polykays. Indeed, the symmetric statistic k r t 

such that 

^[&r,...,t] = K r ' ' ' K t 

(where n r , . . . , n t are cumulants) generalizes fc-statistics and these were originally called 
generalized k-statistics by Dressel (1940). Being a product of cumulants, the umbral 
expression of a polykay is simply 

k r ,..., t ~( X .ay---( X , .a') t , (4.3) 

with \t ■ ■ tX' being uncorrelated umbrae likewise a, . . . , a' satisfying a = ■ ■ ■ = a' . The 
following proposition provides the right-hand product of (4.3) in terms of umbral power 
sum symmetric polynomials. 

Theorem 4.2 (Polykays). If r + h t < n, then 

*,.,«= e (x - x) 7; (x - xr '' dA -'^ e (x*)->.«)*, 

(Ahr,...,„R) W»x+-+»r, *en„ x+ ... +Vrt 

where A = (l ri , 2 T2 , . . .) runs over all partitions of r, r\ = (l* 1 , 2* 2 , . . .) runs over all par- 
titions of t and S n is the subdivision of the multiset 

P A+ ... +J) = { a (-+-^),a 2(r2+ - +t2) ,...} 



Table 1. Formula (4.2) 



A 


Px 


tv e IF A 




tisv 


(X-X)'' r (w.a) Sii 


(I 3 ) 


{« (3) } 


{{Pl,P2,P3}} 


{{a, a, a}} 


1 


2(n.a 3 ) 






{{Pi},{P2,Pa}} 


{{a},{a,a}} 


3 


— (n.a)(n.a 2 ) 




{a<V 2<1) } 


{{Pl},{P2},{Ps}} 


{{a}, {a}, {a}} 


1 


(n.a) 3 


(1S2 1 ) 


{{a,a 2 }} 


{{a,a 2 }} 


1 


-(n.a 3 ) 






{{a}, {a 2 }} 


{{a}, {a 2 }} 


1 


in.a)(n.o?) 


(3 1 ) 


{a 3 } 


{{« 3 }} 


{{« 3 }} 


1 


(n.a 3 ) 
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Table 2. Formula (4.4) 



A + 77 


Px+v 


S rr 






(2 2 ) 


{a 2 , a 2 } 


{{a 2 , a 2 }} 


1 


-(n.a 4 ) 






{{a 2 }, {a 2 }} 
{{a, a, a 2 }} 


1 


(n.a 2 ) 2 


(1 2 ,2) 


{a, a, a 2 } 


1 


2(n.a 4 ) 






{{a,a},{a 2 }} 


1 


-( ? i.a 2 ) 2 






{{a,a 2 },{a}} 


2 


— (n.a 3 )(n.a) 
(n.a) 2 (n.a 2 ) 






{{a}, {a}, {a 2 }} 


1 


(I 4 ) 


{a, a, a, a} 


{{a, a, a, a}} 


1 


-6(n.a 4 ) 






{{a}, {a, a, a}} 


4 


2(n.a)(n.a 3 ) 






{{a, a}, {a, a}} 


3 


(n.a 2 ) 2 






{{a}, {a}, {a, a}} 


6 


— (n.a) 2 (n.a 2 ) 
(n.a) 4 






{{a}, {a}, {a}, {a}} 


1 



corresponding to the partition tt G n„ A _| |_„ . 

Proof. Note that 

a x ■ ■ ■ a' v ~ )- (ri+ - +tl) R)- (r2+ - +t2) • • " - «a + ...+„ 

where we have denoted by A + • • • + r\ the integer partition (l riH l_tl , 2 r2_l l "* 2 ,...). 
Replacing the right-hand product in (4.3) by the product of uncorrelated (2.8) and using 
(3.15), we have 

Equivalence (4.4) is the result of the former in (3.21), where p has been replaced by a 
and the multiset M by P\h i-t ^ 

Example 4-2. Assume r = f = 2. In order to express ^21 we need to consider the pairs 
of partitions {((2), (2)); ((l 2 ), (2)); ((2), (l 2 )); ((l 2 ), (l 2 ))}. Hence, 

f(n) a , for (A, 17) = ((2), (2)), 

< n ) 3 ' for(A,^) = ((l 2 ),(2)) and (A, 77) = ((2), (l 2 )), 
|» 4 , for (A, rj) = ((l 2 ), (l 2 )) 

and av (2) = l,^ 3 = — l,d(2) = <^(i 2 ) = 1- From Table 2, we have 

^ _-s 4 + s?, 2(2s 4 - s 2 - 2s 3 si + s 2 s 2 ) -6S4 + 8s 3 si + 3s| - 6s 2 s 2 + s 4 
(71)2 (71)3 (71)4 
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5. Multivariate moments and multivariate cumulants 



In this section, we define multivariate moments and multivariate cumulants of umbral 
monomials by means of technicalities introduced in Section 3.2. In the following. M = 
{n[ f{fll) \^ 2 f{fl2) \. . . , $ {Pr)) } denotes a multiset of length k. 

Definition 5.1. A multivariate moment is the element of K corresponding to the umbral 
monomial fj>M via the evaluation E , that is, 

E\fjL M ] = m tl ...t r > 

where U = f(/ii) for i = 1, 2, . . . ,r. 

For example, if M = /i 2 2 \ H^}, we have E[/j,m] = mm - When the umbral mono- 

mials fii are uncorrclated, m tl ...t r becomes the product of moments of /Xj. 

Definition 5.2. A multivariate cumulant is the element of K corresponding to 

E \{X-V)M]=Kt 1 ...t r , (5.1) 

where U = f{ni) for i = 1, 2, . . . ,r. 

Replacing the umbra \ in (5.1) by the unity umbra u, we get multivariate moments. 
Setting m = a for i = 1,2, . . . , r, we have M = {c^ fe )} and (x-u)m = (x- a ) k i that i s J the 
ordinary fcth cumulant. 

The notion of the generalized cumulant, introduced by McCullagh (1984), is translated 
into the umbral framework as follows: 

where it is a partition of a set of umbral monomials. For example, when 

7r = {{/ii, / u 2 },{M3},{A t 4,M5}} ) 

we have 

E[(xM = £[(x./iiM2)(x.M3)(x-/MM5)] - ^ 12 < 3 ' 45 , 

where 

,.12,3,45 ig 

the McCullagh's notation. In particular, when 7r = {{fii}, . . . , {fJ. r }\, we 
have (x-aOtt = (x-A*)[r] ; that is, the joint cumulant of /ii, . . . ,fx r . 

The next proposition allows us to express multivariate cumulants in terms of multi- 
variate moments. We only need an extension of (3.11), where the partition tt is replaced 
by a subdivision S. 

Assume S = {m[ 9[Mi) \ M 2 (s(M2)) , . . . , m\ 9(M])) ] to be a subdivision of the multiset M. 
Let 

^^( MMi )-^)... (/i ^). S (M,) ) (5 . 2) 
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where \im ± are uncorrelated umbral monomials. If M = {a^}, then subdivisions are of 
type (3.20) and cr s = a x , with A = (l ri ,2 r2 , . . .) h k. 

Proposition 5.1. If is the subdivision of the multiset M corresponding to the parti- 
tion 7r S Ilfc, then 

(X-M)M^ Y (X-X) M ^- (5.3) 

Proof. By using set partitions instead of integer partitions, equivalence (2.8) can be 
written as 

as has already been done for (3.12) and (3.13). We replace (x- a ) k °y (x-A*)jV) where 
N = {a'"}, and a ,s by ^ S,T , where S„ is the subdivision corresponding to the partition 
7r. We have 

Mn* Y (x-xt 1 ^ 

7ren fe 

and the result holds for a more general multiset M by using umbral substitutions. □ 

When M is a set of k different elements, Proposition 5.1 gives the following well- 
known relations among joint cumulants and multivariate moments of a random vector 
(X 1 ,...,X k ): 

K (x u . ..,x k )=Y (m - ljic-i) 1 ''- 1 n E { n x i ) • 

ireU k Ben \j£B / 

Equivalence (5.3) can be inverted in order to express multivariate moments in terms of 
multivariate cumulants. Indeed, as proven in Di Nardo and Senato (2006a), if k is the 
^-cumulant, then p, = f3.K, where (3 is the Bell umbra. In (5.3), replace the umbra \ by 
the umbra (3 and the umbral monomial [ij by a more general polynomial pj . Since the 
factorial moments of the Bell umbra are all equal to 1 (see Example 2.2), we have 

E[{fi. P ) M \ = Y^- 

Now replace the umbral polynomial p by the umbral polynomial = x-t 1 - From the 
third equivalence in (2.7), we have /3.X-A* = A 4 - This last equivalence allows to prove the 
following proposition. 
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Proposition 5.2. If is the subdivision of the multiset M corresponding to the parti- 
tion 7r G life, then 

Mm - E (x-Ai)' S,r - 

wen*. 

The next propositions and corollary are given by Speed (1983) and McCullagh (1984), 
using different methods and notation. 

Proposition 5.3. Assume the set of umbral monomials {fa, fai, ■ ■ ■ , fa} to be the union 
of two subsets , • ■ ■ , fa t } and {fa t i • • ■ > Mfc s }> with s + t = i, such that umbral mono- 
mials belonging to different subsets are uncorrelated. We have 

(X-I")[i] = (x-Mi)---(X-Mi)^0. (5.4) 

Proof. Let P = l l ji ano - Q = ELi ^fci ■ The polynomials P and Q are uncorrelated. 
By virtue of (2.9), we have 

x .(P + Q) = x-P + x-Q, 

that is, products involving powers of \.P and \.Q vanish, (5.4) being a special case. □ 

When the umbral monomials fa are interpreted as random variables, equivalence (5.4) 
states a well-known result: if some of the random variables are uncorrelated with all 
others, then their joint cumulant is zero. 

Corollary 5.1. If n is a partition of the set {fa, . . . ,fa}, then 

ren, 

T<7T 

Proof. Observe that any partition r satisfying r > tt, has at least one block, say B, 
that is the union of two or more blocks of the partition tt. By Proposition 5.3, we have 
E\(X'V>)b\ = since, in B, there is at least one umbral monomial uncorrelated with each 
other. □ 

Proposition 5.4. If tt is a partition of the set {fa, . . . , fa}, then 

(x-m)* - E (x-x) |p| ^ p . 

pen, 

p>7T 

Proof. Assume that the partition tt = {Si, B2, ■ ■ . , B^} has k < i blocks. From (5.3), we 
have 
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where pj = (j,b } for j = 1,2, ... , k. This result follows by observing that (x-p)[k] — ix-lA^ 
and that each partition r E Hk corresponds to a partition p 6 IT such that p > r and 
p- T ~p-P. □ 

Corollary 5.1 and Proposition 5.4 give the following result. 

Corollary 5.2. If n is a partition of the set {fix, . . . , /i;}, then 

p>TT T<p 

Following the same arguments used by McCullagh (1984), equivalence (5.5) can be 
rewritten as 



pen, 

pVTT = l 

where V means the least upper bound and 1 is the full set. 



6. Multivariate fc-statistics and multivariate polykays 

In order to introduce an umbral version of multivariate fc-statistics, wc must generalize 
the result of Theorem 3.2. 

Theorem 6.1. Let S = {m[ 9(Mi) \ M 2 (ff(Ma)) , . . . , m] 9(M,)) } be a subdivision of M . We 
have 

p- S ^-^—[n.{ X p)]s, (6.1) 

{n)\s\ 

where p, has been defined in (5.2). 
Proof. Observe that if M = {a^}, then 

where S is a subdivision of M of type (3.20). Equivalence (3.15) can then be rewritten 

as 

[n-(x<*)]s - ( n )\s\a' S 

and, by using umbral substitutions, the result holds for a more general multiset M. □ 
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Table 3. Formula (6.2) 



TV 


T 


S-r 


(n./i)s T 


{{pi,P2,Pa}} 


{Bi} 


{{/il,pl,/X 2 }} 


n.(nlfi 2 ) 
n.{n\fX2) 


{{Pl},{P2,P3}} 


{Bi,B 2 } 


{{/il,/LJl,/X2}} 




{{Bi},{B2}} 


{{Ml},{Ml I M2}} 


n.(/ii)n.(/ii/x 2 ) 
n.(/x5> 2 ) 


{{P'2},{PUPS}} 


{Bi,B 2 } 


{{Ml,A*l,M2}} 




{{Bi},{B a }} 


{{Mi}>{A*1.M2}} 


n.(ni)n.(fini 2 ) 
n.{n\^ 2 ) 


{{P3},{P1,P2}} 


{Bi,B 2 } 


{{M2,Ml,Ml}} 




{{B 1 },{B 2 }} 


{{M2},{Ml,Ml}} 


n.(/i 2 )n.(/x?) 
n.(/xiAt 2 ) 


{{pi},{P2},{Pa}} 


{B\,B 2 ,Bi\ 


{{/il,/LJl,/X2}} 




{{BiMBa.Bs}} 


{{Ml},{Ml,M2}} 


n.(ui)n.(unj, 2 ) 




{{^^{Bx.Bs}} 


{{Ml},{Ml,M2}} 


n.(ui)n.(unj, 2 ) 
n.(/x 2 )n.(/xf) 




{{S 3 },{-Bi,B2}} 


{{M2},{A*l,Ml}} 




{{SiM^M^}} 


{{Ml},{Ml},{M2}} 


n.(/i 2 )[n.(^ti)] 2 



Theorem 6.2 (Multivariate fc-statistics). If n> \M\ = k, then 

(X-m)m^ £ %3r^ E (x-x)- r ("-M)5 T , (6.2) 

7ren fc W M ren k | 

where S T is the subdivision of M corresponding to the partition r o/ i/ie sei iwi/i the 
blocks of 7r. 

Proof. By Theorem 6.1 and Proposition 5.1, we have 

(x-m)m- V 7^ [n-{xii>)}s*- 
Wis 

The result follows from the first part of (3.21) and by recalling that | -tt | = D 

It is interesting to note the similarity between expressions (4.2) and (6.2). On the 
right-hand side of (6.2), the set partition replaces the integer partition, the support of 
M having a cardinality greater than 1. 

Example 6.1. In order to express fei, take the multiset M = fJ.^} of length 

k = 3. Define the function s: [3] — > {/ii,/^} such that s(pi) = s(p2) = fJ-i and s(pa) = fi2- 
Let Sp. q ~ n.{pL\[i\). From Table 3, we have 

k 2 i ~ (x-Mi) 2 (x-M2) — ^^[« 2 S2.i - 2nsi. si.i - ns 2 .os ,i + 2sf s ,i]. 
("la 
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Multivariate polykays were introduced by Robson (1957). The symmetric statistic 
kt 1 ...t T ,....i 1 ...i m such that 

E[kt 1 ...t r ,...,h-lm] = K «l...tr - ' ■ K h---l m 

(where Kti...t r is a multivariate cumulant) generalizes polykays. Being a product of un- 
corrected multivariate cumulants. the umbral expression of a multivariate polykay is 
simply 

k tl ...t r ,...,h...i m - (x-/«)t • • • (x'-v')l, 
where x an d x' are uncorrelated likewise the umbral monomials /igT and /j'g! where 

T = {^\...,^},...,L={^\---^'i m) }- 
Let 5 be a subdivision of T and S* a subdivision of L. 

Theorem 6.3 (Multivariate polykays). For n > \T\ + ■ ■ ■ + \L\, we have 

k tl ...t r ,...,h-i m - 2^ (n) 2^ (X-X) ( n -P)s^ 

(7ren| T |,...,7r€n|i|) ren w+ ... +1S | 

(6.3) 

where S T is the subdivision of the multiset obtained by the disjoint union of T, . . . , L, 
with no uncorrelation labels and corresponding to the partition r of the set built with the 
blocks of {ir, ...,jt}. 

Proof. Observe that if S n is the subdivision of T corresponding to the partition tt £ IT| T | 
and Sft is the subdivision of L corresponding to the partition tt e then we have 

where we have denoted by SV + ■ ■ ■ + S# the subdivision obtained by placing side by side 
the blocks of subdivisions with no uncorrelated elements. By equivalences (5.3) and (6.1), 
we have 

where p is the generic element of + • • • + . The result follows by applying the first 
part of (3.21). □ 



Also in this case, note the similarity between expressions (4.4) and (6.3). The sum of 
subdivisions in the second case corresponds to the sum of integer partitions. 
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Table 4. Formula (6.3) 





T 




(n.p) s ; 


{{Ml»P2},{X}} 

{{Ati},iM2},{/i'i}} 


{Bi,b,} 

{{Bi},{Ba}} 

{{Bi,Ba,B 8 }} 

{{fli,B a },{Bs}} 

{{Bi.Bs},^}} 

{{B 2 ,B 3 },{Bi}} 

{{Bi},{S a },{B 3 }} 


{{//!, AJl,/Lt 2 }} 

{{Ml.M2},{Ml}} 

{{Ml.Ml,M2}} 

{{M1.Mi},{A«2}} 

{{Ml.M2},{Ml}} 

{Oil, Ate}, {/xi}} 
{{Ati},{A*2},{Ati}} 


n.(fj,iH2)n.fii 

n.(nln2) 

n.(/Lti)n./i2 

n.(/ii/i 2 )n./ii 

n.(/ii/i 2 )n./ii 

(n.fii) 2 n.fj,2 



Example 6.2. In order to express fcii,i, let T = {^i,/^},!* = {^il an d — ^(MiMiD 
From Table 4, we have 

— 7— T"[si,0Sl,l - s 2,l] _ T^T [ s l S 0,1 - 2si nSl.l + 2s 2 ,l - 52,0^0, l]- 

(n) 2 (n) 3 ' 



7. A fast algorithm for ^-statistics 

It is possible to build a very fast algorithm for fc-statistics by forfeiting the elegant idea 
of producing only one algorithm for the whole subject. We are going to prove that fc- 
statistics can be recovered through cumulants of compound Poisson random variables. 
This connection allows us to insert exponential polynomials in formula (4.2), eliminating 
set partitions. 

In order to construct such an algorithm, we need ratios of umbrae. Therefore, we 
introduce the notion of the multiplicative inverse of an umbra. Two umbrae are said to 
be multiplicatively inverse to each other when 

c*7 = u. (7.1) 

In dealing with a saturated umbral calculus, the multiplicative inverse of an umbra is not 
unique, but any two multiplicatively inverse umbrae of the umbra a are similar. From 
(7.1), we have 

Q-nffn = 1 for every n = 0, 1, 2, . . ., that is, g n = — . 

a n 

where a n and g n are moments of a and 7, respectively In the following, a multiplicative 
inverse of an umbra a will be denoted by I /a. 
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7.1. Cumulants of compound Poisson random variables 



The main result of this section is that cumulants of an umbra a can be expressed via 
cumulants of compound Poisson random variables. 

Let us consider the umbra x-V-P introduced in Di Nardo and Senato (2006a), where y 
is an indeterminate. As well as the umbra x-P nas moments all equal to 1, the moments 
of this umbra are all equal to y, as the following lemma states. 

Lemma 7.1. If x is the singleton umbra and (3 is the Bell umbra, then 



Proof. Observe that x-V-P = X-U-P- u = {x-V)-P- u : where the last equivalence follows 
from the associative law. So, by virtue of (2.5), we have 



On the other hand, (x-v) = XV an( i the only power of (x-y) different from zero is the one 
corresponding to v\ = 1 for which A = (i). Therefore, the sum in (7.3) reduces to y. □ 

As stated in Example 2.4, the umbra (x-y-(3).a = x-(y-P-a) is the cumulant umbra of a 
polynomial a-partition umbra and corresponds to a compound Poisson random variable 
of parameter y (Di Nardo and Senato (2001)). Such an umbra is the keystone for building 
cumulants. 

First, let us compute the moments of n.x-(y-ft-a), that is, the sum of n uncorrelated 
cumulant umbrae of a polynomial a-partition umbra. 

Proposition 7.1. The polynomial umbra n.%. {y.(3. a) has moments 



(x-y-PY - y, 



i = l,2,.... 



(7.2) 




(7.3) 




(7.4) 



with co(y) — 1 and Ci(y) of degree i for every integer i. 



Proof. Due to the associative law, we have n.x-{y-(3.ot) = (n.x-y)-(3.a so that 




(7.5) 



is the result of (2.5). Let Ci(y) = E[(n.x-y-l3.a) 1 ]. From (2.1), we have 
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Since 

(xy)> 



yi, forA = (F), 
0, otherwise, 



we have (n.x~yY x — (n) vx y l/x . Replacing this last equivalence in (7.5), the result fol- 
lows. □ 

The next theorem is the key to producing a fast algorithm for fc-statistics. It states 
that the a-cumulant umbra has moments umbrally equivalent to umbral polynomials 
obtained by replacing y with the umbra X-xl n -X m Ci(j/) , i = 1,2, ... . 

Theorem 7.1. If Cj(y) = E[(n.x-y-f3-ct) 1 }, then 

(X.aY-JZZ), i = l,2,.... (7.6) 
Proof. In (7.4), replace y by the umbra x-x/ n -X- We have 



C; 



AH 

Due to the uncorrelation property, we have 



X-X 

n 



E 



X-X 
n.x 



= 1^ wf( n )^«AaA- 



Recalling that £[(x.x)' yA ] = (-1)" A K-1)! and E[{n. X ) Vx ] = {n) Vx , the result follows. □ 

7.2. fc-statistics via compound Poisson random variables 

In the previous section, we have stated the connection between cumulants of an umbra 
and those of compound Poisson random variables. In order to recover the umbral expres- 
sions of /c-statistics given in (4.2), it is sufficient to express the moments of n-x-x.p.a 
in terms of power sums n.a 1 . This task has been partially accomplished in Section 3.1 
for the umbra n.(\ a )- We are going to extend such relations to a more general umbra 
71.(70). 

Lemma 7.2. In K[xi, X2, ■ ■ ■ , x n ] [A] , we have 

X-(7lXl H h JnX n ) = (x-tVi ( 7 - 8 ) 

where a is the power sum polynomial umbra and {7i}f— 1 are uncorrelated umbrae similar 
to an umbra 7. 
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Proof. Due to property (2.9), we have 

X.(7lXl H VlnXn) = +7=l\X-(li x i)] = +i=l(X-li)xi, 

where the last equivalence follows due to the fact that x-( c l) = c (x-l) f° r every c G K . 
Since the umbrae 7i are similar to the umbra 7, we have 

from which (7.8) follows. □ 

Corollary 7.1. If a is the power sum polynomial umbra and {7i}™ =1 are uncorrelated 
umbrae similar to the umbra 7, then 

(71X1 H h -f n x n y ~ ^ d\(x-l)\o-\- 

AN 

Proof. Taking the left dot product with (3 in (7.8) and recalling that (3.\ = u, we have 

(71X1 H h7„o; n ) =/3.[(x-7) CT l- ( 7 - 9 ) 

The result follows via (2.4). □ 

Replacing the indeterminate a;, by a,, the next theorem follows immediately. 

Theorem 7.2. J/ a, 76 A, i/ien 

[n.( 7 a)] i; ~ 2 d A ( X . 7 )A(n.a) ri (n.a 2 )^ • • • , 

AN 

where A = (l ri ,2 r2 , .. .) hi. 

Since x-D-P- a = (x-U-P) a (Di Nardo and Senato (2006a), formula (31)), Theorem 7.2 
allows us to express the polynomials Ci(y) in terms of power sums n.a 1 . This is the 
starting point to prove the following result. 

Theorem 7.3. Let 

71 

^(y) = Ey fc5 ( n ' fc )(- 1 ) fe_1 ( fc - 1 ) ! ' ( 7 - 10 ) 

k=l 

where S(n, k) are the Stirling numbers of second type. We have 

(X.ay ~ V d\p\ (n.ar (n.a 2 ) r > • ■ • , 

where X = (V 1 , 2 1 " 2 ,. . .) H and Px {y) = [pi(y)] ri (p 2 (y)P ■ • 
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Proof. From Theorem 7.2, we have 

d(y) s ^d A (x.X.2/./3)A(n.«) ri (n.a 2 p • • • , (7.11) 

Ahi 

with Ci(y) the ith moment of n.[(x-y-f3)a\. Note that x-X-U-P = U<1> V-P, so powers of 
X-X-Z/ are umbrally equivalent to the exponential umbral polynomials 

n 

(Di Nardo and Senato (2006a)) with 7 replaced by u < ~ 1> y, that is, 

n 

^( W < - 1> y) = ^/5(n,fc)( u <- 1 >) fc . 
fe=i 

Observing that p n (y) = £ , [^ n (it <_1> y)], the result follows from (7.11) since (x-X-V-P)^ — 

p\(y)- □ 

The MAPLE algorithm, which implements the result of Theorem 7.3, is the following: 
make Tab := proc(TV) 

[seq([An/mul((a;!)~numboccur(y, x) * numboccur(y, a;)!, 

x = {op(y)}) * mul(fc[i], i = y), mvl(S[i],i = y)], y = combinat 
['partition'] {N))}; end : 
makeK := proc(iV) 

[seq(fc[i] = add(combinat['stirling2'](i, j) * x^ * — 1) * (j — 1)!, 

j = l..i),i= l.JV)];end : 
fd := proc(j, h) 

expand(mul(n — t — h,t = 0..j — ft.)); end : 
kstat := proc(iV) 
locaki, v; 

v := expand(eval(makeTab(iV), makeK(iV))); 

u := [seq(x i = (-1)1* - 1) * (i - 1)! * fd(iV - 1, i), i = 1. JV)]; 

v := cxpand(cval(u, u)); 

l/mul((n — x), x = 0..N — 1) * add(x[l] * x[2], x = v); end : 



8. Concluding remarks 

Umbral formulae for fc-statistics and polykays, either in single or multivariate cases, 
share a common algorithm to construct multiset subdivisions. When the multiset has 
the form {cr*'}, this corresponds to computing partitions of the integer i, but integer 
partitions cannot be employed when the multiset has a more complex form. Even though 
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Table 5. Comparisons between computational times for single fc-statistics obtained by using 
the MATHEMATICA procedures of Andrews and Stafford, those of MATHSTATICA and the 
algorithm constructed via Theorem 7.3 (runs on PC 2.08 GHz, 512MB RAM) 



i 


Andrews and Stafford 


MATHSTATICA 


Fast umbral algorithm 


8 


0.72 


0.03 


0.00 


10 


3.49 


0.08 


0.00 


12 


24.80 


0.20 


0.00 


14 


396.34 


0.56 


0.05 


16 


58002.60 


1.69 


0.11 


18 




5.42 


0.20 


20 




19.11 


0.41 


22 




69.66 


0.81 


24 




285.58 


1.66 


26 




1551.48 


3.49 


28 




6324.28 


7.78 



the idea of constructing the function s is fundamental in constructing umbral formulae 
involving multisets, this is not efficient from a computational point of view. Indeed, ex- 
amples have shown how subdivisions may occur more than once in the same formula. 
We have therefore constructed an algorithm that generates only different subdivisions 
and enumerates how many times each subdivision occurs; see Di Nardo et al. (2008). 
Such an algorithm is the heart of the procedure Polykays which produces fc-statistics, 
polykays, multivariate fc-statistics and multivariate polykays using less computational 
time than those implemented by Andrews and Stafford (2000). For single and multi- 
variate fc-statistics Polykays has computational times comparable with MATHSTATICA 
(Rose and Smith (2002)) up to order 7 and a little worse for higher orders. Moreover, 
Polykays allows the computation of multivariate polykays, unlike MATHSTATICA. 

Finally, in Section 7 of this paper, we have proven that the umbral techniques not 
only provide a unifying structure for the whole subject, but also a new way of improving 
the computational generation of such estimators. An immediate example is the formula 
stated in Theorem 7.3, whose derived algorithm realizes the amazing computational times 
shown in Table 5. 
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